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Xylella fastidiosa causes bacterial leaf scorch in landscape trees, including mulberry. We determined the draft genome of the 
mulberry strain Mul-MD in order to gain a better understanding of the molecular basis of strain divergence, host specificity, nu- 
trient requirements, and pathogenicity, as well as to develop genome-based specific detection methods. 
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Xylella fastidiosa is a Gram-negative, nutritionally fastidious, 
insect-transmitted, and xylem-inhabiting bacterium that 
causes a wide range of plant diseases, including Pierce's disease of 
grapevine and bacterial leaf scorch, in landscape trees such as mul- 
berry. So far, eight X. fastidiosa genomes are available, including 
five complete genomes for the citrus variegated chlorosis strain 
9a5c (1), Pierce's disease strains Temecula 1 (2) and GB514 (3), 
and almond strains M12 and M23 (4), as well as three draft ge- 
nomes for the oleander strain Annl, the almond strain Dixon (5), 
and the X. fastidiosa biocontrol strain EB92-1 from elderberry (6). 
However, no genomes from landscape trees had been reported. 
We therefore sequenced the mulberry strain of X fastidiosa, Mul- 
MD, which was isolated in 2011 from a mulberry tree displaying 
leaf scorch symptoms in Beltsville, MD. 

Genomic DNA of X. fastidiosa strain Mul-MD was extracted 
from a triply cloned pure culture in periwinkle wilt medium (7) 
using a Blood and Tissue kit (Qiagen, Inc., Valencia, CA) accord- 
ing to the manufacturer's instructions. Random shotgun and 3-kb 
mate-pair libraries of Mul-MD were generated and sequenced us- 
ing Roche 454 GS (FLX titanium) pyrosequencing, resulting in 
137,284 shotgun reads and 426,457 mate-pair reads totaling 
254,742,009 bases, with a read-length average of about 450 bases. 
The total number of reads after processing by the Newbler Assem- 
bler from all libraries was 852,805 aligned reads, with 133,080,992 
bases aligned. Using Newbler gsAssembler v 2.6, we assembled the 
genome into 188 contigs, of which 101 were >500 bases in size, 
and 27 scaffolds. The largest contig was 395,385 bases. Among the 
large contigs, the N 50 contig size was 134,146 bases. The contigs 
have an average length of 13,528 bases and were run through the 
annotation pipeline, which uses GeneMark to predict coding re- 
gions based on prior Xylella fastidiosa gene models and runs 
BLASTX against a protein set that includes UniProt and all known 
Xylella proteins to determine edges of genes. The translated pro- 
tein sequences were processed using Interproscan v. 4.8 for func- 
tional annotation and UniProt for additional descriptive informa- 
tion. Open reading frames shorter than 150 bases were eliminated. 



Selected open reading frames from in silico analysis that were not 
consistent with annotated Xylella genes were manually anno- 
tated. tRNA and rRNA predictions were made using the latest 
tRNAscan-SE and RNAmmer servers, respectively. 

The 5X draft genome of the X fastidiosa strain Mul-MD con- 
tains 2,543,372 bp and has a GC content of 51.65%. A total of 
2,286 protein-encoding genes are predicted, 1,437 of which have 
tentatively been assigned a function. In addition, an ~25-kb plas- 
mid sequence was found that is similar to the four plasmids, pXF- 
RIV11, pXF-RIV16, pXF-RIV19, and pXF-RIV25, present indi- 
vidually in four California mulberry strains of X. fastidiosa (8), as 
well as to the plasmid associated with the grapevine GB514 strain 
of X fastidiosa. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun sequence has been deposited at DDBJ/EMBL/GenBank 
under the accession number AXDP00000000. The version de- 
scribed in this paper is version AXDP01000000. 
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